Search Result Clustering Method at NTCIR-5 Web Query Expansion Subtask
نویسندگان
چکیده
We use a retrieval system with search result clustering to tackle the NTCIR-5 WEB Query Term Expansion Subtask. The system clusters the search results in such a way as to make it easier for the user to select relevant documents as feedback documents. In addition, we select phrase words or named entities(NE) as query-expansion keywords from the feedback documents because these words tend to represent the characteristics of feedback documents and can retrieve relevant documents that were not retrieved by the initial keywords. Based on our evaluations, we report the efficiency of keyword expansion and the number of relevant documents in the feedback documents.
منابع مشابه
RUCIR at NTCIR-12 IMINE-2 Task
In this paper, we present our participation in the Query Understanding subtask and the Vertical Incorporating subtask of the NTCIR-12 IMine-2 task, for both English and Chinese topics. In the Query Understanding subtask, we combine the extracted candidates from search engine suggestions and Wikipeida, and classify their verticals after clustering and ranking them. In the Vertical Incorporating ...
متن کاملSearch Intent Mining by Word Vectors Clustering at NTCIR-IMine
This paper presents a method for intent mining based on semantic vectors and search results clustering. Our algorithm represent words as documents and performs a state-of-theart approach for query log driven clustering. Similarities between query logs and words are calculated by using semantic vectors. Based on a manual selection of vertical representatives, our method is able to correctly iden...
متن کاملHITSZ-ICRC at NTCIR-11 Temporalia Task
* Corresponding Author ABSTRACT Temporal Information Access (Temporalia) task is a pilot task at NTCIR-11 for the first year. HITSZ-ICRC group participated in Temporalia task, worked in both Temporal Query Intent Classification (TQIC) subtask and Temporal Information Retrieval (TIR) subtask. In TQIC subtask, firstly, we extracted different linguistic level features from user query, extracted ex...
متن کاملOverview of the NTCIR-5 WEB Query Term Expansion Subtask
The query term expansion subtask was conducted to establish an evaluation framework for information retrieval (IR) systems that focus on the effectiveness of query term expansion techniques. However, the quality of query term expansions are affected by several factors (e.g., IR system using expanded query, quality of initial query, etc.), so it is difficult to evaluate this technique. In this s...
متن کاملNTCIR-5 Query Expansion Experiments using Term Dependence Models
This paper reports the results of our experiments performed for the Query Term Expansion Subtask, a subtask of the WEB Task, at the Fifth NTCIR Workshop, and the results of our further experiments. In this paper we mainly investigated: (i) the effectiveness of query formulation by composing or decomposing compound words and phrases of the Japanese language, which is based on a theoretical frame...
متن کامل